b94f0b3c03b0652f94d15d6dba47e8d127b49c71
s444498 prod test another repo algo
- challenge
- "He Said She Said" classification challenge (2nd edition)
- submitter
- [anonymized]
- submitted
- 2023-11-03 12:15:41.829443 UTC
- file basename
- out
dev-0 / 77a487ba232f2c111e6436b3082d3dc643442e2d
Metric | Score |
---|---|
Likelihood | 0.00000 |
Accuracy | 0.51690 |
Likelihood | Accuracy | |
---|---|---|
+H | 0.00000 | 0.51000 |
+C | 0.00000 | 0.00000 |
-C | 0.00000 | 0.51691 |
worst items
note: the gold standard is taken from the submission itself, not from the challenge data!# | input | expected output | actual output | dev-1 Likelihood +C |
---|---|---|---|---|
1 | Cierpiałem na straszne lagi – kilkanaście sekund lub dłużej czarnego ekranu przy próbie przełączenia się / uruchomienia prawie każdej aplika… | 1 | 0 | 0.00000 |